AD-A223  636 


REPORT  DOCUMENTATION  PAGE 


form  aoproved 
0A4B  No.  0 704-01 U 


OHr*)  RtPORT  0AT1  }.  MPORT  TY«  AMO  0AT1S  CQVIRID 

_ Final  Report,  1  Oct  87  to  30  Sep  89 


*.  mu  amo  suatnu  s.  funoino  numsirs 

USE  OF  AN  INDEFINITE  INNER  PRODUCT  FOR  COMPUTING 

DAMPED  NATURAL  MODES  AFOSR-88-0048 

■  61102F  2304/A3 

i.  AUTHOR'S) 

B.  N.  Parlett  and  H.  C.  Chen 


7.  HRFORMMO  ORGANIZATION  NAMI(S)  AND  AOORISSUS) 

University  of  California 
Department  of  Mathematics 
Berkeley,  CA  94720 


I.  MltfOMMM  ORGANIZATION 
MMMT  NUMBER 


AFOSR-TR-  UII  07  3  4 


SPONSORING  /  MONITORING  AGINCY  NAMC(S)  AMO  AOOR1SS4IS) 

AFOSR/NM 
Building  410 

Bolling  AFB,  DC  20332-6448 


AGINCY  REPORT  NUMMR 


AFOSR-88-0048 


12a.  OtSTRHUTKM/AVANAMUTV  STATIMIN 

Approved  for  public  release 
distribution  unlimited. 


(Maximum  200  want) 


ELECTE 
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A  quadratic  eigenvalue  problem  with  symmetric  costive  definite 
coefficient  matrices  may  be  reduced  to  linear  form  while  retaining 
symmetry  in  the  new  coefficients  but  neither  of  them  will  be  positive 
definite.  Formally  the  symmetric  lanczos  algorithm  and  subspace 
iteration  may  be  used  to  compute  some  elgenpalrs  of  the  linear 
problem.  The  trouble  Is  that  the  basis  vectors  are  orthogonal  with 
respect  to  an  Indefinite  Inner  product  so  there  Is  no  assurance  that 
they  will  be  linearly  independent.  Nevertheless  this  Is  an  attractive 
way  to  solve  the  original  problem  and  we  discuss  how  to  implement  it 
and  how  It  relates  to  the  unsymmetric  Lanczos  procedures..  We  discuss 
complex  origin  shifts,  reorthogonal Izatlon,  and  error  bounds.  Several 
methods  for  solving  the  reduced  problem  are  mentioned  but  we  have  no 
fully  satisfying  technique.  Some  dangers  are  described  and  examples 
are  given  comparing  our  Lanczos  program  with  a  modified  subspace 
Iteration. 
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Abstract  "A  quadratic  eigenvalue  problem  with  symmetric  positive  definite  coefficient 
matrices  may  be  reduced  to  linear  form  while  retaining  symmetry  in  the  new  coefficients 
but  neither  of  them  will  be  positive  definite.  Formally  the  symmetric  Lanczos  algorithm 
and  subspace  iteration  may  be  used  to  compute  some  eigenpairs  of  the  linear  problem. 
The  trouble  is  that  the  basis  vectors  are  orthogonal  with  respect  to  an  indefinite  inner  pro¬ 
duct  so  there  is  no  assurance  that  they  will  be  linearly  independent.  Nevertheless  this  is 
an  attractive  way  to  solve  the  original  problem  and  we  discuss  how  to  implement  it  and 
how  it  relates  to  the  unsymmetric  Lanczos  procedures.  We  discuss  complex  origin  shifts, 
reorthogonalization,  and  error  bounds.  Several  methods  for  solving  the  reduced  problem 
are  mentioned  but  we  have  no  fully  satisfying  technique.  Some  dangers  are  described 
and  examples  are  given  comparing  ou.r  Lanczos  program  with  a  modified  subspace  itera¬ 
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1.  INTRODUCTION 


In  the  study  of  Mechanics,  when  one  analyzes  how  small  displacements  from  an  equilibrium  state 
evolve  in  time  one  is  led  to  the  familiar  equations  of  motion 

Mq(t)  +  Cq(t)+Kq(0  =  f(f)  (1.1) 

where  M,  C,  and  K  are.  respectively,  the  rtxn  mass,  damping,  and  stiffness  matrices  and  q(r),  q(r),  and 
q(r)  are  the  n  xl  acceleration,  velocity,  and  displacement  vectors. 

For  many  solid  structures  attached  to  the  earth  the  damping  coefficients  are  small  compared  with 
those  of  mass  and  stiffness.  For  structures  in  space  damping  plays  a  bigger  role  and  the  undamped  model 
may  not  be  a  good  approximation.  To  understand  the  response  of  the  system  to  a  variety  of  external 
forces  f(r)  it  is  still  useful  to  know  the  system’s  dominant  natural  modes  of  vibration.  This  leads  to  the 
quadratic  eigenvalue  problem 

+  C  +  K)Wi  =0  j  =  1, . . . , 2n  (1.2) 

In  general  X,  will  be  complex  and  the  associated  modal  shapes  are  given  by  the  real  and  imaginary  parts 
of  w,- .  In  the  problems  we  consider  M,  C,  K  are  real,  symmetric,  and  positive  definite. 

Our  paper  has  several  goals.  In  Section  2  we  make  a  plea  for  keeping  symmetry  explicit  in  the 
reduction  to  linear  form  even  though  the  resulting  pencil  (or  pair  of  matrices)  is  not  definite.  We  are  cer¬ 
tainly  not  the  first  to  make  this  suggestion.  In  Section  3  we  show  how  to  make  a  complex  shift  of  origin 
and  yet  keep  the  eigenvalue  algorithm  confined  to  real  arithmetic.  This  idea  appears  not  to  have  been 
used  before.  Sections  4  and  5  are  theoretical,  showing  important  connections  but  do  not  contain  new 
results.  After  that  we  describe  an  implementation  of  a  Lanczos  algorithm  for  symmetric  indefinite  pen¬ 
cils.  It  is  designed  to  exploit  any  small  bandwidth  in  M,  C,  K.  We  describe  how  selective  orthogonaliza- 
tion  can  be  carried  over  from  the  definite  case  (Section  6),  the  numerical  dangers  that  beset  the  algorithm 
in  our  case  (Section  7),  the  solution  of  the  reduced  eigenvalue  problem  (Section  8),  and  how  to  compute 
accurate  error  estimates  at  very  low  cost  at  any  step  of  the  algorithm  (Section  9).  Finally,  some  numerical 
results  and  comparisons  make  up  Section  10. 

We  mention  here  that  certain  problems  need  more  study.  (1)  When  M  is  singular,  or  nearly  so.  the 
computed  Lanczos  vectors  can  acquire  large  components  in  M’s  null  space  without  the  algorithm  being 
able  to  detect  that  this  is  happening.  We  have  a  remedy  but  do  not  know  how  completely  it  cures  the 
problem.  (2)  The  efficient  solution  of  the  reduced  eigenvalue  problem  is  an  interesting  challenge.  How¬ 
ever,  in  the  context  of  the  whole  algorithm,  it  is  satisfactory  to  employ  a  standard,  robust  algorithm  that 
does  not  exploit  all  the  features  of  the  reduced  problem.  A  Lanczos  program  similar  to  ours  has  been 
developed  independently  (see  [Nour-Omid  &  Regelbrugge,  1988])  and  used  effectively  to  solve  the 
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equations  of  motion  (1.1)  directly  without  computing  eigensolutions. 

We  follow  standard  notational  conventions  in  the  field  of  matrix  computations.  In  particular,  capital 
letters  denote  matrices,  small  Roman  letters  denote  column  vectors,  and  small  Greek  letters  denote 
scalars.  Also  v'  denotes  the  transpose  of  v  and  a  denotes  the  conjugate  of  a  complex  number  a.  Unless 
the  contrary  is  indicated  llxll  denotes  the  Euclidean  norm  of  x,  I  IBM  denotes  the  spectral  norm  of  B. 
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We  suggest  that  this  reduction  to  standard  form,  either  to  (2.3)  or  (2.5),  is  a  tactical  error. 

In  order  to  explain  our  view  the  following  standard  terminology  will  be  needed.  A  symmetric 
matrix  is  indefinite  if  it  has  eigenvalues  of  both  signs. 

Definition  1.  Let  Y  e  R"*"  be  symmetric  but  indefinite.  The  bilinear  form  defined  by 

( u,  v  )y  :=  v*  Y  u 

is  called  a  pseudo  (or  indefinite  or  improper)  inner  product.  It  obeys  all  the  axioms  of  an  inner  pro¬ 
duct  except  positivity. 

Definition  2.  Let  F  e  R"  ** ,  Y  e  R"  *" ,  with  Y  symmetric. 

F  is  symmetric  with  respect  to  Y,  if 

F*  Y  =  ( Y  F )'  =  Y  F  , 


or,  equivalently,  if 


(u,Fv)y  =  (Fu.v)y 


for  all  u  e  R",  v  e  R" . 

In  some  contexts  one  says  that  F  is  self-adjoint  in  Y’s  (pseudo)  inner  product. 

The  trouble  with  (2.2)  and  (2.3)  is  that  neither  of  them  reminds  us  that 

-K'1  C  -K-1  M 
I  0 


is  symmetric  with  respect  to  both 


C  M 
M  0 


and 


-K  0 
0  M 


Similarly  the  first  matrix  in  (2.5)  is  symmetric  with  respect  to 

-I  0 

0  I 
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We  see  no  gain  in  efficiency  in  going  all  the  way  to  (2.5)  in  the  reduction  but  circumstances  may  change. 


In  brief,  (2.1)  suggests  the  relevant  symmetries  but  (2.2)  does  not.  (2.1)  is  a  special  case  of  the 
problem 

(A  -  -jj-  B)  z  =  0 

where  A  and  B  are  symmetric  but  indefinite.  A  matrix  pair  (A.  B)  is  sometimes  called  a  matrix  pencil. 
See  [Gantmacher,  1959]  for  example.  In  our  case  we  have  a  symmetric  indefinite  pencil.  Excellent  refer¬ 
ences  for  further  study  of  these  pencils  see  [Gohberg,  Lancaster,  and  Rodman,  1983  and  1986], 

Lemma.  B-1  A  is  symmetric  with  respect  to  both  A  and  B.  A  B_1  is  symmetric  with  respect  to 

both  A-1  and  B-1. 

Proof:  For  all  u.vinlR", 

( u,  B-1  A  v  )A  =  v*  A  B-1  A  u  =  (  B-1  A  u,  v  )A  . 

(  u,  B-1  A  v  )B  =  v*  A  u  =  (  B-1  A  u,  v  )B  . 

The  second  result  can  be  obtained  similarly.  □ 

It  turns  out  that  the  Lanczos  algorithm  may  be  invoked  with  the  operator  B_1  A  using  a  pseudo  inner 
product  defined  by  A  or  B.  The  three  term  recurrence  still  holds  in  this  more  general  situation.  There  is 
more  on  the  Lanczos  algorithm  in  Section  4. 

Subspace  Iteration  [Bathe  and  Wilson,  1976)  or  [Rutishauser,  1970]  (also  called  the  method  of 
Simultaneous  Iterations)  also  extends  formally  to  the  indefinite  case  but  the  Rayleigh  Ritz  approximations 
produced  at  each  step  are  no  longer  optimal  in  any  meaningful  sense.  We  have  made  the  necessary 
modification  to  our  standard  SI  program  and  use  it  as  a  simple  rival  to  our  Lanczos  procedure. 

In  contrast  to  the  definite  case  both  algorithms  can  break  down  or  become  unstable  when  close  to 
breakdown.  Alarming  things  can  happen  with  an  indefinite  (or  improper)  inner  product :  a  set  of  orthogo¬ 
nal  vectors  might  be  linearly  dependent.  The  geometry  associated  with  such  an  improper  inner  product  is 
the  geometry  of  relativity  theory. 

Next  we  wish  to  point  out  that  it  is  possible  to  work  with  A  B_1  instead  of  B-1  A  but  we  see  no 
advantage  to  this  foimulation  in  our  problem.  Since  A  B_1  is  symmetric  with  respect  to  B_1  and  A-1  we 
must  work  with  these  pseudo  inner  products.  Now  A-1  is  complicated  but  when  B  *  diag(- K,  M)  then 
we  can  form  the  Cholesky  factorization  of  M  and  K  once  for  all.  After  that  the  computation  of  B_1  v 
costs  no  more  than  the  computation  of  B  v.  The  need  for  shifts  of  origin  (see  next  section)  suggests  the 
use  of  an  operator  H“l  A  and  the  pseudo  inner  product  defined  by  A.  Here  H  may  be  B  or  B  -  o  A  or 
Re  (B  -  o  A)  or  Im  (B  -  o  A). 
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Both  fonns  (2.1)  and  (2.2)  are  well  known.  In  a  recent  paper  [Born  &  Mantegazza  1977]  form  (2.1) 
is  explicitly  proposed  when  M,  C,  and  K  are  all  symmetric.  However,  no  mention  is  made  of  the  fact  that 
the  Rayleigh  quotients  used  in  that  paper  may  overflow  or  yield  0/0.  A  conventional  alternative  to  the 
pseudo-symmetric  form  (2.1)  or  (2.4)  is  to  apply  the  two  sided  Lanczos  algorithm  to  the  matrix  in  (2.3)  or 
(2.5).  In  Section  4  we  show  the  connection  between  these  two  approaches. 

We  recall  that  the  attraction  of  the  Lanczos  algorithm  is  that  a  few  of  the  largest  eigenvalues  may  be 
found  by  stopping  the  algorithm  long  before  the  full  /t  steps.  It  is  usual  practice  to  give  the  algorithm  an 
operator  such  as  H-1  A  whose  largest  eigenvalues  are  shifted  reciprocals  of  the  ones  we  really  want.  We 
can  expect  to  find  the  p  largest  eigenvalues  within  max  (p+8, 2 p }  steps.  See  Section  10  for  examples. 
By  reducing  the  n  xn  quadratic  problem  to  linear  form  the  dimension  of  the  Lanczos  vectors  becomes  2 n . 
However  the  cost  of  implementing  the  Lanczos  algorithm  is  only  doubled  (approximately)  because  the 
structure  of  the  matrices  A  and  B  may  be  exploited.  The  algorithm  and  operation  counts  are  given  in  Fig¬ 
ure  1. 

Note  that  we  could  have  used  the  algorithm  for  the  operator  H-1  A  using  the  pseudo-inner  product 
defined  by  H.  The  variation  alters  the  subroutines  supplied  to  the  Lanczos  algorithm,  not  the  program 
itself.  There  may  be  advantages  to  the  H  inner  product  but  we  have  not  studied  the  matter  since  H  may 
be  complicated. 


3.  ORIGIN  SHIFTS 

There  are  applications  (e.g.  space  structures)  in  which  the  stiffness  matrix  K  is  singular.  This 
makes  B  =  diag  (-K,  M)  singular  too.  In  this  situation,  we  need  to  solve  a  shifted  problem 

(X-c)  A  z  =  (B  -  o  A)  z 

where  a  *  0  is  a  real  shift.  To  preserve  the  block  diagonal  form,  we  note  that  the  B  -  a  A  can  be  factored 
into 

I  -ol 
0  I 

We  need  to  factor  only  M  and  the  shifted  stiffness  matrix  K  +  oC  +  ^M,  which  has  the  same  banded 
structure  as  K  in  order  to  solve  (B-oA)u»Av  for  u  given  v.  Thus  we  may  solve 

(B  -  cA)~lAz  =  —  1  z  to  obtain  eigenvalues  close  to  o.  This  is  standard  practice. 


'  (-K-oC-cTM)  0 

I  o' 

0  M 

-Cl  I 

In  several  applications  the  user  would  like  to  explore  part  of  the  complex  plane  and  see  how  cenain 
eigenvalues  vary  with  changes  in  the  matrix  elements.  Unfortunately,  if  c  is  complex  then  B  -  a  A  is  not 
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real  and  so  (B -o  A)-1  A  is  not  a  valid  operator  for  our  Lanczos  algorithm. 

A  way  out  of  this  quandary  was  proposed  in  [Pariett  and  Saad,  1987].  The  operator  given  to  the 
Lanczos  algorithm  is  either 

Re  [  (B  -  a  A)-1  ]  A  or  Im  [  (B  -  a  A)-1  ]  A  .  (3.2) 

Here  Re  G  :=  (G  +  G),  Im  G  :=  --  (G  -  G),  and  G  is  the  conjugate  of  G.  Both  operators  are  real 
and  have  complex  conjugate  pairs  of  eigenvalues. 

Let  X,  z  be  an  eigenpair  of  (B,  A),  i.e.  they  satisfy  X  A  z  =  B  z.  By  subtracting  o  A  z  and  a  A  z 
from  each  side  and  inverting  one  finds  that 


Re((B-oA)-,]Az=|(T^-  +  -U)z  . 

2  A,— O  A— O 

(3.3a) 

Im  [  (B  -  o  A)'1  ]  A  z  =  — “  (t-^ - “=■)  z  , 

2 1  X-o  X-o 

(3.3b) 

To  recover  X  from  the  computed  eigenvalues  of  either  of  these  operators  requires  merely  solving  a  qua¬ 
dratic  equation  for  the  root  that  is  closer  to  a.  However  X  may  be  recovered  directly,  with  extra  work, 
from  the  Rayleigh  quotient 

X  =  ?  Bz/z*  Az  . 

In  order  to  exploit  narrow  bandwidth  it  is  convenient  to  use  complex  arithmetic  in  (3.1)  to  factor 
B-cA  and  thus  to  solve  ,  for  we  C",  (B  -o  A)  w  =  A  v  for  any  given  ve  R".  However,  the  subrou¬ 
tine  returns  either  Rew  or  Imw  and  so  there  is  no  complex  arititmetic  in  the  Lanczos  algorithm.  The 
major  computation  is  in  factoring  K + a C + o2  M.  This  is  done  once.  Note  that  the  pseudo-inner  product 
is  defined  by  A  and  is  independent  of  o. 

Both  the  real  and  imaginary  part  of  (B  -  o  A)-1  provide  good  enhancement  of  the  eigenvalues  close 
to  a.  It  is  shown  in  (Pariett  and  Saad,  1987]  that  the  imaginary  part  suppresses  the  unwanted  eigenvalues 
far  from  o  more  strongly  than  does  the  real  part  More  experience  is  needed  on  this  aspect  of  the  shift. 

4.  CONNECTION  BETWEEN  LANCZOS  ALGORITHMS 

The  original  formulations  given  by  Lanczos  (in  [Lanczos,  1950])  considered  the  standard  symmetric 
eigenvalue  problem  A  z  =  X z,  i.  e.  B  =  I.  The  Lanczos  vectors  (q* for  symmetric  nxn  A  obey  the 
well-known  3-term  recurrence  relation 


A  q*  =  P*q*-i  +  o*<lk  + 


k  =  1 . n 


(4.1) 
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with  q0  =  0.  The  {q*}*.!  form  an  orthononnal  set.  The  attractive  feature  of  the  Lanczos  algorithm  for 
the  generalized  problem  A  -  X  B,  where  B  is  symmetric,  positive  definite  is  that,  as  in  (4.1) 

B  1  A  q*  =  (3*q*_i  +  cqq*  +  ftk+iq*+i 

The  Lanczos  vectors  are  still  orthonormal  but  in  the  inner  product  defined  by  B.  Usually  they  will  not  be 
orthogonal  in  the  ordinary  Euclidean  sense. 

In  general,  for  a  nonsymmetric  matrix  F  (we  rename  B_1  A  as  F)  there  is  a  two-sided  generalization 

(also  given  by  Lanczos)  in  which  two  sets  { pj . p„  }  and  { q, . <1,  }  are  generated.  Moreover,  if 

P*  q*  =  1,  then 

Fq*  =  0*<Lk-i  +  a*q*  +Y*+,q*+, , 

F'  P*  =  Y*  P*-i  +  a*  p*  +  P*+iP*+i  ,  (4.2) 

and  the  { p*  },  { q*  }  are  biorthogonal, 

p/q*=0,  i*k 

This  algorithm  will  break  down  when  p*'  q*  ■  0  with  p*  *  0,  q*  *  0.  We  can  expect  inaccuracy  when 
p*  q*  is  very  small.  There  is  more  on  this  point  in  Section  5. 

For  theoretical  purposes  it  is  convenient  to  normalize  the  Lanczos  vectors  by 

p*q*  =  l  ilp*ll  =  llq*ll  (4.3) 

provided  breakdown  does  not  occur.  However,  in  practice,  it  is  better  to  keep 

llp*ll  =  llq*ll  =  l 

and  define 


=  P*  q*  (4.4) 

If  to*  is  tiny  (  <  1(T8  or  eI/2)  then  the  { pt  }  and  { q*  }  are  nearly  linearly  dependent  and  consequently 
make  bad  bases  in  which  to  represent  the  solutions.  The  normalization  (4.3)  conceals  any  deterioration  in 

the  quality  of  the  basis  { q} . q*  }.  The  normalization  (4.4)  makes  the  three  term  recurrence  slightly 

more  complicated.  Define 


at  Yz 
P2  <*2  Ys 


ft.-i  %>-i  Yu 
ft.  tx. 


=  K  FQ,. 


Q,  =  diag (fi)j . a>H)  =  K  Q„. 

Q » =(qi - q *). 

p*  =(Pi . p„) 


g 


then,  in  matrix  form,  the  three  term  recurrence  is 

F  Q„  -  Q„  O"1  J,  =  0  , 

P'F-J,  U"1  P'=0 

Note  that  when  F  =  A  =  A'  and  B  =  I  then  P,  =  Q* ,  =  I,  and  J,  =  T„  =  T^. 


(4.5) 


(4.6) 


In  our  pseudo-symmetric  eigenvalue  problem  A  -  XB  we  face  the  same  possibility  as  in  (4.2)  that 
the  Lanczos  process  may  break  down  and  may  be  unstable  when  close  to  breakdown.  To  distinguish  this 
algorithm  from  the  two-sided  algorithm  given  by  (4.5)  and  (4.6)  we  label  the  Lanczos  vectors  in  our  algo¬ 
rithm  as  qi . 4, .  It  is  important  to  know  when  this  instability  occurs  and  so  we  prefer  to  normalize 

the  q*  to  satisfy 

11^*11  —  1  ,  for  all  k  ,  (4.7) 

and  then  define 


=(4k-4k)A- 

A  tiny  value  of  co*  is  a  sign  of  danger.  The  three  term  recurrence,  in  matrix  form,  is 

b-‘aq,  -Q„  q;1  f ,  =0  , 

AB-'AQ,  ,  (4  8) 

a„=Q'  aq„ 

provided  that  breakdown  does  not  occur.  Note  that  T *  is  symmetric.  Equating  the  k  th  column  on  each 
side  of  (4.8)  yields  the  three  term  recurrence  for  our  pseudo-symmetric  formulation 

B"‘  A  q*  =  (&/to*_i)  q*_i  +  (a*/tot)  q*  +  ((5*+,/cok+1)  q*+i ,  k  <  n  (4.9) 

This  equation  shows  the  possible  danger  of  small  values  among  the  { co, } . 

The  goal  of  the  preceding  discussion  was  to  prepare  for  the  interesting,  but  natural,  result  that  our 
pseudo-symmetric  procedure  (4.8)  &  (4.9)  is  a  disguised  form  of  the  two-sided  algorithm  (4.6).  By 
choosing  pj  appropriately  it  turns  out  that  p*  =  A  q* ,  for  all  k ,  and  so  there  is  no  need  to  hold  the  p* 
explicitly  in  memory. 
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Theorem.  Suppose  that  A  and  B  are  real  and  symmetric.  Suppose  that  no  breakdown  occurs.  Let 
the  two-sided  Lanczos  algorithm  in  (4.6)  be  applied  with  operator  B-1  A  (starting  with  p,  and  qj)  to  pro¬ 
duce  two  biorthogonal  sequences  (p*  }*«lt  {q*  }*.].  Let  the  pseudo-symmetric  Lanczos  procedure  in 
(4.9)  be  applied  with  B_1  A  (starting  with  qj)  with  pseudo- inner  product  defined  by  A  to  produce  (q*  }*ml. 
IfqisqiandpjsAqi/IIAqillthenq*  =  q*  and  p*  =  A  q*  /IIAq*ll,  k  -  2 . n. 

Proof  :  We  use  induction  and  let  u  I  I  v  mean  that  u  is  a  nonzero  multiple  of  v.  Put  k  =  1  in  (4.9)  and 
equate  column  1  on  each  side  of  (4.6)  to  find 

qj  I  I  B'1  Aqj-q,  (dt!/©!)  , 

q2  I  I  B_1  A  q2  -  q,  (cq/©!)  . 

p2  I  I  A  B“!  p3  -  pi  (aj/toj)  . 

By  choice  q,  *  q,  and  A  q,  =  p(  IIA  qill.  so 

:=  (qi.  4i)a  =  (qi.  Qi)a  =  q{  Pi  HA  q,ll  =  a>,  HA  qtll  , 

a,  :=  (qlt  B_1  A  qj)A  =  (q,.  B-1  A  q^A  =  p{  B_1  A  q,  IIA  q,ll  =  a,  IIA  q,ll . 

Thus  cq /©!  =  a^©!  and  hence  ^  =  q2  and  p2  I  I  A  q2  /  IIA  q^l.  So  p2  =  A  q2  /  IIA  q2ll. 

The  induction  assumption  is  that  the  theorem  holds  for  k  =  ;  -  1  and  j.  Put  k  =  j  in  (4.9)  and 
equate  column  j  on  each  side  of  (4.6)  to  find 

q>+i  1  1  B_1  A  q,  -q;  (dy/©,)-q,_,  (3/©;-i)  . 

q>+i  1 1  B-'  A  qy  -q;  (a;/©;)-qy_,  (y, /©,_,)  . 

P/+1  I  I  A  B~l  Py  -  py  (a,/©,)  -  py_,  (p, /©,_,) 

Use  the  induction  assumption  to  verify  that 

©y_!  :=  (A>-1.  q>-i)A  =  (q>-i.  q>-i)A  =  qj-i  P>- 1  HA  q,_,ll  =  ©y_j  IIA  q,_,ll  , 

I*,  (A>-i.  B"1  A  qy)A  =  (qy_„  B_l  A  q,)A  =  P;_,  B_1  A  q,  IIA  q>_,ll  =  p,  IIA  q,_,ll  . 

:s=(q/.q>)A=(q>.q>)Assq>p>  ha  qyii  =  coy  iia q>» . 

dj  :=  (q;.  B_1  Aq;)A  =  (q/t  B_1  A  qy)A  » pj  B_1  A  q;  IIAq;ll  =  a;  IIA  qy II  . 


Yy  :=  Py-i  B_1  A  qy  =  IIA  qy_,ll  qj.,  A  B"1  A  qy  = 


IIA  q7  II 


qj.,  A  B"'  py  = 


IIA  q,  II 


using  these  relations  we  verify  that 


Ay+i  =  q>t-i 
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P>+1 1 1  A  Vi  ■ 

Thus,  py+i  =  A  q;+j  /  IIA  q;+1ll  and  the  result  holds  for  k  =j  +  \.  By  the  principle  of  induction,  if  the 
algorithms  do  not  break  down  then  the  theorem  holds  for  k  =  1, 2 . n .  □ 

5.  MOMENT  MATRICES  AND  HYPERBOLIC  PAIRS 

The  breakdown  of  the  (generalized)  form  of  the  Lanczos  algorithm  may  be  interpreted  as  break¬ 
down  in  the  triangular  factorization  of  a  certain  matrix.  This  connection  is  well  known  (see  [House¬ 
holder,  1964]  or  [Pariett,  Taylor,  and  Liu,  1985])  and  gives  valuable  insight  We  review  it  now.  Let 

K j  =  [q„  F  \,q„  (B-1A)2q, . (B^Ay-'q,]. 

This  is  called  a  Krylov  matrix.  When  Successful  the  Lanczos  algorithm  constructs  a  matrix 

Q;  =  tai . q>] 

of  Lanczos  vectors  which  are  A-orthogonal,  i.e. 

Qj  A  Qj  =  Clj  =  diagonal. 

Moreover  Q;  and  K;  are  connected  by 

K,=QyLj 

where  L;  is  an  invertible  lower  triangular  matrix.  In  other  words  Qy  is  obtained  from  K,  by  the  Gram- 
Schmidt  process.  However,  Lanczos  found  a  clever  way  to  find  q;  without  actually  invoking  Gram- 
Schmidt.  This  result  is  not  obvious;  it  expresses  the  fact  that  B_I  A  q*  is  a  linear  combination  of  qm_1, 
q,„ ,  and  qm+j  and  from  this  it  can  be  deduced  that  for  each  m  =  1, 2 . j,  (B_1  A)m  qj  is  a  linear  com- 

t 

bination  of  qt,  q2,  •  ■  • ,  qm+t  only. 

In  the  present  context  the  moment  matrix  is  defined  as 

M,  =  [ffl^]  , 

=  (q|.  (B-'A)'+*-2  q,)A  =  A  (B_IA),+*_2  q, 

A  little  manipulation  shows  that 

M;  =  Kj  A  Kj  . 

Consequently,  if  the  Lanczos  process  does  not  breakdown, 


My  =  Ly  Qy  A  Qy  Ly  =  Ly  Q;  L  j 

This  is  the  triangular  factorization  of  M;.  We  do  not  insist  that  the  diagonal  elements  of  L,  be  1. 
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Triangular  factorization  exists  if  all  the  leading  principal  submatrices  of  My  are  nonsingular  (i.e.  inverti¬ 
ble).  Conversely,  if  the  last  diagonal  element  of  Oy  is  the  first  to  vanish  then  the  Lanczos  algorithm 
breaks  down  at  the  end  of  step  j . 

Observe  that  K;  and  Qy  are  each  nxj  matrices  whereas  Ly  is  invertible.  Thus  Ky  has  full  rank  if 
and  only  if  Qy  has  lull  rank.  It  is  desirable  (though  not  essential)  that  Qy  have  full  rank. 

Lemma.  Let  Ky,  Qy ,  My,  and  A  be  as  defined  above.  If  Ky  has  full  rank  and  A  is  symmetric,  posi¬ 
tive  definite  then  so  is  My  and  the  factorization  Ly  Qy  Lj  exists  with  £2 y  positive  definite. 

Proof :  For  any  v  e  IR; , 

v*  My  v  =  v'  Kj  A  Kj  v  =  W  A  w  >  0  , 

unless  w  =  0.  If  Ky  has  full  rank  then  w  =  0  implies  v  =  0.  Hence  My  is  positive  definite.  The  leading 
principal  submatrices  of  a  positive  definite  matrix  are  positive  definite  and  thus  My  permits  a  triangular 
factorization.  □ 

When  A  is  indefinite  then  there  exist  vectors  w  *  0  such  that  w'AwSfl  and,  except  in  trivial  cases, 
there  will  be  starting  vectors  q!  and  j  -vectors  v  such  that  the  A-norm  ofKy  v  vanishes  for  large  enough 

j- 

Definition:  v  is  isotropic  if  v*  A  v  =  (v,  v)A  =  0. 

It  is  helpful  to  regard  the  occurrence  of  breakdown  (i.e.  when  qy  is  isotropic)  not  as  disaster  but  as  a 
reminder  that  A  is  indefinite.  There  is  a  natural  way  to  proceed.  The  idea  goes  back  to  Lagrange  but  was 
put  to  use  by  D.  G.  Luenberger  to  extend  the  conjugate  gradient  algorithm  [Luenberger,  1969]  and  by 
[Bunch,  Kauffman,  and  Pariett,  1976]  to  stablize  triangular  factorizatioa 

By  a  simple  rotation  of  coordinates  the  hyperbola  x2 -y2  =  1  may  be  written  as  J;  q  =  2.  In  our  con¬ 
text  the  idea  is  to  modify  our  generalized  Lanczos  algorithm  as  follows.  If  qy  is  an  isotropic  vector  then 
choose  the  value  of  ay+1  in  (4.1)  or  (4.9)  so  that  qy+1  is  also  isotropic.  Provided  that  qy  A  qy+1  is  not  too 
small  these  Lanczos  vectors  still  provide  a  good  basis.  They  are  no  longer  A-orthogonal.  Nevertheless 
Qy+1  A  Qy+1  is  block  diagonal  with  2x2  diagonal  blocks  of  the  form 

'  0  o 

tO/j+i  0 
^  J 

whenever  the  modification  is  used.  We  follow  Luenberger  [Luenberger,  1969]  in  calling  qy.  qy+t  a 
hyperbolic  pair. 
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In  practice  there  is  no  need  to  wait  until  q j  A  q;  is  negligible  before  switching  to  a  hyperbolic  pair 
for  q j  and  q;>j.  By  forsaking  strict  A-orthogonality  we  gain  a  better  conditioned  basis.  Similar  thinking 
for  the  two  sided  algorithm  yields  die  Look  Ahead  algorithm  [Parlett,  Taylor,  and  Liu,  1985]  but  the  pro¬ 
cedure  is  rather  complicated.  Our  experience  is  limited  but  the  only  breakdowns  we  have  encountered 
were  produced  deliberately. 


6.  LOSS  OF  ORTHOGONALITY 

It  is  well  known  that  the  three  term  recurrence  does  not  produce  A-orthogonal  Lanczos  vectors  in 
finite  precision  arithmetic.  It  is  the  convergence  of  the  algorithm  that  provokes  this  deterioration  not  can¬ 
cellation.  There  are  three  ways  to  respond  to  the  situation. 

1.  Do  nothing. 

This  technique  causes  Lanczos  sequences  to  be  2  or  3  times  larger  than  necessary.  It  is  only  of 
interest  when  the  whole  spectrum  is  wanted.  The  loss  of  orthogonality  does  not  prevent  the  calculation  of 
fully  accurate  eigenvalues  and  eigenvectors.  It  merely  slows  down  the  process. 

2.  Full  reorthogonalization  at  each  step. 

Here  the  vector  computed  by  the  three  term  recurrence  is  explicitly  oithogonalized  against  all 
preceding  Lanczos  vectors.  This  requires  keeping  the  auxiliary  vectors  0*  (=  Aq*)  in  fast  memory 
(unless  virtual  memory  is  in  use)  and  using  them  every  step.  However  for  short  runs  of  20  or  30  Lanczos 
steps  the  cost  is  not  excessive. 

3.  Selective  reorthogonalization. 

Most  of  the  benefits  derived  from  a  set  of  Lanczos  vectors  that  is  A-orthogonal  to  working  accuracy 
are  also  enjoyed  when  the  vectors  are  merely  semi-orthogonal. 

Definition.  Let  e  be  the  roundoff  unit.  Then  two  vectors  u,  v  are  semi-orthogonal  with  respect  to  A 
if 

I  u'  A  v  I  <^e  Mull  -  llvll  -  IIAII  . 

Two  techniques  have  been  proposed  for  maintaining  semi-orthogonality.  In  [Parlett  and  Scott.  1979] 
it  is  shown  that  computed  vectors  tend  to  be  pulled  towards  converged  Ritz  vectors.  Hence  it  helps  to 
orthogonalize  the  current  Lanczos  vectors  against  such  vectors  from  time  to  time.  This  remedy  only 
addresses  one  mechanism  that  provokes  orthogonality  loss. 
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In  [Simon,  1984]  a  recurrence  was  found  that  governs  the  inner  products  among  the  Lanczos  vectors 
themselves.  This  recurrence  permits  an  accurate  estimate  of  the  orthogonality  loss  in  the  current  Lanczos 
vector  to  be  computed  at  each  step  at  a  cost  proportional  to  the  number  of  Lanczos  steps  takea  It  is  then 
easy  to  orthogonalize  the  new  vector  against  all  the  old  ones  whea  and  only  whea  necessary  in  order  to 
maintain  semi-orthogonality. 

The  recurrence  extends  easily  to  cover  A-orthogonality  and  our  algorithm  incorporates  this  form  of 
selective  orthogonalization.  As  a  rule  of  thumb  selective  orthogonalization  has  a  cost  about  1/3  the  cost 
of  full  reorthogonalization.  For  large  problems  on  some  computer  systems  the  I/O  costs  of  reorthogonali- 
zation  dominate  the  arithmetic  costs. 

A  full  discussion  of  semi-orthogonality  and  how  to  maintain  it  is  given  in  [Parlett,  Nour-Omid,  and 
Liu,  1988]. 


7.  DANGERS 

The  previous  sections  show  that  the  symmetric  quadratic  eigenvalue  problem 

(X2M  +  XC  +  K)u  =  0  (5.1) 

may  be  reduced  to  the  simpler  form 

(T--L_Q)u  =  0  (5.2) 

where  T  is  real  symmetric  and  tridiagonal  and  Q  is  real  diagonal  (or  block  diagonal). 

Any  difficulties  inherent  in  the  original  problem  must  be  inherited  by  the  reduced  problem.  Impor¬ 
tant  features  are : 

(1)  complex  eigenvalues  are  present  and  are  wanted. 

(2)  Sometimes  (depending  on  C)  an  eigenvalues  X  may  be  degenerate;  it  may  belong  to  a  nondiagonal 

Jordan  block  of  Q_I  T.  This  can  happen  when  X  is  a  double  real  eigenvalue  (the  coalescence  of  a 
conjugate  pair  of  eigenvalues)  or  when  both  X  and  X  are  double  eigenvalues. 

We  consider  (1).  Since  T  and  Q  are  real  it  is  desirable  to  postpone  the  use  of  complex  arithmetic. 
Indeed  if  the  HR  algorithm  [Bunse-Gerstner,  1981;  Brebner  and  Grad,  1982]  is  used  then  the  pair  (T.  Q) 
is  eventually  transformed  into  (T,  A )  where 

A  =  diag(±l,  ±1 . ±1) 

and  T  is  block  diagonal  with  2x2  and  lxl  blocks.  Each  complex  conjugate  pairs  of  eigenvalues  is  found 
from  a  real  pencil  of  the  form 
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No  complex  arithmetic  is  needed  in  this  case  but  the  HR  algorithm  is  not  always  stable.  However  there 
are  alternative  techniques  for  finding  some  or  all  of  the  eigenvalues  of  (T,  Cl)  and  exploiting  the  banded 
form.  See  Section  8. 


Thus  (1)  is  not  a  serious  difficulty. 

Now  consider  (2) :  degenerate  multiple  eigenvalues.  Theorem  15-2-1  in  [Pariett,  1980]  states  that 
any  real  square  matrix  may  be  written  in  the  form  B-1  A  where  A  and  B  are  real  symmetric.  Conse¬ 
quently  a  symmetric  indefinite  pair  (A,  B)  may  suffer  from  highly  defective  eigenvalues.  A  simple  exam¬ 
ple  is 

001a]  f 0  0  0  1 

0  1  a  0  0  0  1  0 

1  a  0  0  ~X  0  10  0 

a  0  0  0  1  0  0  0 

which  is  a  Jordan  block  in  disguise. 

If  T  has  some  off  diagonal  elements  that  vanish  then  the  reduced  eigenvalue  problem  splits  up  into 
smaller  pieces.  Each  piece  consists  of  a  T  with  all  its  P,  values  nonzero. 

Lemma:  If  no  off  diagonal  entry  p,  of  T  vanishes  then  to  each  eigenvalue  of  T  -  (X  -  o)_1  Cl  there 
corresponds  exactly  one  eigendirection. 

Proof:  The  minor  of  the  (i ,  j)  entry  is  (P1P2  'P>-i)  *  0  for  all  X  and  so  the  rank  never  drops  below  y-1. 

□ 

However  it  is  still  possible  to  have  eigenvalues  of  high  multiplicity. 

This  result  is  to  be  expected.  Whatever  the  geometric  multiplicity  of  an  eigenvalue  of  A  -  XB  the 
Lanczos  algorithm  can  only  "see"  the  projection  of  the  starting  vector  qi  onto  the  invariant  subspace  asso¬ 
ciated  with  the  eigenvalue.  Thus,  in  exact  arithmetic,  it  is  possible  for  T  -  X  Cl  to  have  generalized  eigen¬ 
vectors  with  lower  grades  than  the  true  multiplicity  of  the  associated  eigenvalues.  It  is  exactly  the  same 
in  the  truly  symmetric  case.  The  Lanczos  algorithm  cannot  "see"  geometric  multiplicities. 

Unfortunately  it  is  the  prescence  of  degenerate  eigenvalues  that  causes  breakdown  of  the  HR  algo¬ 
rithm. 


Example  of  a  multiple  eigenvalue  : 


1  1 
1  1 
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has  0  as  a  double  eigenvalue  with  eigenvector  (1,  -1)'  and  any  vector  other  than  the  eigenvector  is  a  gen¬ 
eralized  eigenvector  of  grade  2.  There  is  no  simpler  symmetric  representation  of  such  an  eigenvalue. 

We  are  studying  the  effect  of  multiple  eigenvalues  on  the  performance  of  the  Lanczos  algorithm. 

There  is  another  difficulty  more  insidious  than  the  first  two. 

(3)  Undetected  growth  of  Lanczos  vectors  in  certain  directions.  The  following  example  was  given  by  Dr. 
T.  Ericsson  [private  communication] 

Example : 

A.  =  diag(\,-\,x,x . x), 

B  =  diag(- 1.  l.x.x . x), 

qi  =  (o,  o.x.x . x)'. 

Then  (qti  q,)A  is  independent  of  a  and  consequently  both  T  and  ft  will  be  independent  of  o.  The  trouble 
is  that 

(l.i.o . o  y 

is  an  eigenvector  of  B-1  A  with  eigenvalue  1  and  (a,  z)A  *  (z,  z)B  *  0.  Arbitrarily  large  multiples  of  z 
could  be  present  in  the  Lanczos  vectors  (on  account  of  qO  and  the  Lanczos  algorithm  would  be  blind  to 
them.  This  possibility  of  undetectable  growth  in  certain  directions  is  a  generalization  of  the  phenomenon 
reported  in  [  Nour-Omid,  Parlett,  Ericsson,  &  Jensen,  1987]  where  the  direction  was  in  the  null  space  of 
A.  Here  it  is  the  isotropic  eigenvectors  that  are  invisible. 

It  remains  to  be  seen  whether  the  practice  of  keeping  all  Lanczos  vectors  with  Euclidean  length  1 
will  alleviate  the  problem  or  merely  drive  the  Lanczos  vectors  into  the  space  spanned  by  undetectable 
eigenvectors. 


8.  HOW  TO  SOLVE  T-XQ 

(1)  Use  EISPACK  on  Ty .  (Subroutines  HQR 1  or  HQR2) 

The  tridiagonal  form  will  be  expanded  to  Hessenberg  form  by  the  QR  algorithm.  Thus  no  advan¬ 
tage  is  taken  of  the  compact  tridiagonal  form.  The  arithmetic  effort  is  approximately  8 y3  to  find  all  the 
eigenvalues  at  step  j . 

(2)  The  HR  algorithm  [Bunse-Gerstner,  1981;  Brebner  and  Grad,  1982]. 
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Let  52  =  I  ©,•  I ,  i=l . ;,  and  A  =  diag(Bi . 8y).  The  pencil  (T, G)  is  equivalent  to 

(A-1  T  A-1,  f)  where  i«  =<o,  /  1(0, 1,  i  =  1, . . . ,  j.  The  HR  algorithm  is  a  generalization  of  the  sym¬ 
metric  tridiagonal  QR  algorithm.  However  hyperbolic  rotations  of  the  form 


cosh  0  sinh  0 ' 

sec  0  tan  0 

sinh0  cosh  0 

b  > 

or 

tan  0  sec  0 

b  d 

are  used  in  place  of  plane  rotations  whenever  the  associated  diagonal  elements  of  I  have  opposite  signs. 
Complex  shifts  may  be  used  without  evoking  complex  arithmetic  in  the  same  way  as  they  are  used  in  the 
unsymmetric  QR  algorithm. 

The  only  weakness  is  that  the  HR  algorithm  can  break  down  f r  certain  shift  values  and  can  also  be 
unstable  when  the  shift  is  close  to  breakdown.  In  addition  it  will  ..  .d  all  the  eigenvalues  at  each  step. 
This  is  overkill.  When  the  algorithm  succeeds  it  needs  approximately  lOy'2  arithmetic  operations  for  a 
jxj  pair  (T,  fl). 

(3)  The  LR  algorithm  applied  to  Q"1  T}. 

This  is  closely  related  to  the  HR  algorithm.  It  is  efficient,  finds  all  the  eigenvalues  and  can  break 
down. 

(4)  ANALYZE! 

This  application  of  Laguerre’s  algorithm  was  developed  for  use  with  the  two-sided  Lanczos  algo¬ 
rithm.  A  data  structure  consisting  of  some  eigenvalues  of  the  pencil  (T;_lf  ny_j)  toger  -  with  their  error 
estimates  is  updated  to  produce  some  eigenvalues  of  (T;,  Q;)  and  their  error  estimate  fhese  estimates 
are  measures  of  how  close  the  eigenvalue  is  to  an  eigenvalue  of  the  original  pair  (A,  B-oA). 

To  update  one  eigenvalue  a  sequence  of  Laguerre  iterates  is  computed  starting  with  an  old  one.  The 
calculation  of  the  Laguerre  iterate  takes  full  advantage  of  the  tridiagonal  form  but  complex  arithmetic  is 
used.  The  program  is  still  under  development.  The  difficulty  is  to  detect  and  find  new  large  eigenvalues 
that  are  not  close  to  any  at  the  previous  Lanczos  step.  One  remedy  is  simply  to  find  all  eigenvalues  of 
(T;,  Q;)  at  each  step. 


Let 


9  ERROR  BOUNDS 

(T,  -ea,)S=o  . 


(9.1) 


with  llsll  =  1,  define  a  typical  eigenpair  (0,  s)  of  the  reduced  problem.  Let  F  be  the  operator  given  to  the 
Lanczos  algorithm  and  let  F  be  symmetric  with  respect  to  A.  After  j  steps,  in  exact  arithmetic. 
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FQ,  -Qj  a-1  Tj  =  q,+1  (Py+1/<0/+,)«;  (9.2) 

The  ‘Ritz  vectors’  conesponding  to  6  in  (9.1)  is  defined  by 

y:=Q)S.  (9.3) 

To  obtain  an  error  bound  for  the  approximate  eigenpair  (0,  y)  one  postmultiplies  (9.2)  by  s  and  uses  (9.1) 
to  obtain 

F  y  -  y  0  =  q;+1  (0y+i  /  co>+1)  s(j )  (9.4) 

and 

IIF  y  -  y  011  =  (p,+,  /  (Oj+\)  I  sO  )  I  (9.5) 

since  llqy+1ll=l.  Unfortunately,  it  is  not  possible  to  evaluate  llyll  without  computing  y.  Note  that 
y'  A  y  =  s'  Clj  s  but  this  quantity  might  be  0  or  negative  in  general. 

When  A  is  positive  definite  then  it  defines  a  proper  norm  on  R"  via 

|lvllA  =  (v4  A  \)xa 

In  that  case  llq1llA  =  ''/a>i", 1=1,...,;',  and 

IIF  y  -  y  0I1A  =  (0y>,  /  Vco^)  I  s(/ )  I  . 

HyllA  =  (s,Q>s)1/2  .  (9.6) 

The  following  error  bound  extends  well  known  results  for  symmetric  matrices.  Consequently  (9.6)  yields 
error  bounds  without  the  need  for  computing  with  vectors  in  R" . 

Theorem  A.  If  F  is  symmetric  with  respect  to  a  symmetric  positive  definite  A  then,  for  any 
y  e  R"\0  and  0e  R,  there  is  an  eigenvalue  A,  of  F  satisfying 

IX  —  01  S  IIF  y  -  y  0llA  /  llyllA  . 


Proof: 

HyllA  =  H(F  -  0)~l  (F  -  0)  yllA  <, 

ll(F - 0)_1llA  IIF y - y  0llA  =  IIF y - y  0llA  / minx IX-0I  □ 

This  result  depends  strongly  on  the  existence  of  a  full  set  of  A-orthogonal  eigenvectors  for  F  and  is 
not  valid,  or  necessary  meaningful,  when  A  is  indefinite.  Thus  we  cannot  use  (9.6)  just  as  it  is. 

For  indefinite  A  we  may  invoke  the  results  presented  in  [Kahan,  Parlett.  and  Jiang,  1982]  and  adapt 
them  to  our  problem.  Two  residual  norms  are  required.  Let 
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u  :=  F  y - y  0  ,  v*  :=x*  F-0x*  ,  (9.7) 

where  y*  x  *  0,  llyll  *  llxll  =  1.  Now  we  are  obliged  to  work  in  C"  rather  than  R" . 

Theorem  B.  With  the  notation  given  above  (0,  y,  5? )  is  an  eigentriple  of  F  -  E  for  some  E  satisfy¬ 
ing 

IIEII  £max(  Hull,  Hv'll }  . 

Note  that  llv*  II  =  llvll.  In  other  words,  if  Hull  and  llvll  are  both  tiny  then  (0,  y,x*)  are  an  eigentriple  for  an 
operator  almost  indistinguishable  from  the  given  one. 

In  our  case,  if  F  =  B-1  A  then  F*  =  A  B-1  and  we  have  from  (9.4), 

B'1  A  y  -  y  0  =  qy+1  (fy+!  /  to;>,)  s0' )  ■ 

The  second  residual  is  obtained  easily  by  premultiplying  by  A. 

(A  B"1)  (A  y)  -  (A  y)  0  =  A  q>+1  (p>+1  /  co>+1)  s(J) 

Let 

PO)  =  (p,+1/co;>1)  Is0)l  •  (9.8) 

Then  (0,  y.  A  y)  is  an  eigentriple  of  F  -  E  for  some  E  satisfying 

„  f  l  HAq.^ll] 

When  the  computation  has  proceeded  enough  that  I  P(/)l  is  very  small  then  first  order  perturbation 
theory  may  be  invoked  to  obtain  an  accurate  error  estimate  for  0  by  regarding  F  as  a  change  to  F  -  E.  If 
\  is  the  eigenvalue  of  F  closest  to  0  we  have,  in  general, 

ix-ei  =M_M .  J1AiL  +  0(iieii2)  ,  (9.io) 

ly'xl  Hyl!  •  llxll 

The  first  term  on  the  left  is  independent  of  E  and  is  the  condition  number  of  0  as  an  eigenvalue  of  F  -  E: 

cond(0)  *  llyll  •  llxll  /  I  y*  x  I  .  (9.11) 

We  now  adapt  these  results  to  our  problem.  Note  that 

lly  =  Is*  Qj  Qj  si  &j  Is*  si  =  ;  . 
and 

lly*  A  yll=  Is*  Qj  A  Qj  si  S  Is*  Q;  si  . 
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Now  we  can  obtain  an  error  bound  As  IIEII  — >  0,  using  (9.9) 


IA.-0I  £ cond(0) •  IIEII  +  O (IIEII2)  . 

■y-^;  IIA y"  P(/) maxi  j-,  +  O (IIEII2)  , 

\f  Ayl  [  HyH  HAyll  J 


JSZL 


l?  Q j  si 


max(  IIA  yl  I, IIA  qy>1ll  •  llyll )  +  O (IIEII2)  . 


£  IIAII  +  0  (IIEII2)  .  (9.12) 

I?  ft;  si 

The  attraction  of  the  last  inequality  is  that  the  dominant  term  can  be  calculated  at  step  j ,  without  recourse 
to  n -vectors,  provided  that  HAH  is  provided  along  with  the  subroutine  that  multiplies  vectors  by  A.  It  is 
useful  to  compare  (9.12)  with  (9.6)  and  Theorem  A. 


We  use  pO' )  /  I  s'  ft;  si  as  a  provisional  error  estimate.  When  the  required  number  of  Ritz  values  0 
have  passed  this  test  then,  and  not  before,  the  Ritz  vector  y  may  be  computed.  At  that  point  the  more  pre¬ 
cise  factor 


max{  IIA  yll,  IIA  q/+ill  *  llyll ) 
may  be  computed  at  the  cost  of  forming  A  y. 

At  the  end  of  a  Lanczos  run  a  multiple  of  A  q;+1  is  available  and  its  Euclidean  norm  can  be  com¬ 
puted  at  the  cost  one  dot  product  No  extra  call  on  A  is  necessary  for  that  term. 


10,  NUMERICAL  EXAMPLES 

In  this  section,  we  use  several  Test  Problems  to  assess  the  performance  of  the  proposed  algorithm  to 
extract  the  eigenpairs  of  damped  dynamic  systems.  The  mass,  damping,  and  stiffness  matrices  of  discre¬ 
tized  systems  are  computed  using  the  FEAP,  a  Finite  Element  Analysis  Program,  [Zienkiewicz  1977, 
Chapter  24],  The  results  reported  herein  are  obtained  using  a  VAX  Station  n/GPX  computer  system 
using  the  Ultrix  1.2  operating  system  and  the  f77  Fortran  compiler.  To  show  the  savings  resulting  from 
the  use  of  selective  orthogonalization  (SRO),  we  have  added  a  full  re-orthogonalization  (FRO)  option  to 
our  program  and  present  the  comparisons  in  Table  1 . 

Test  problem  1  :  The  structure  is  modeled  as  a  cantilever  beam  with  a  lumped  translational 
viscous-damper  attached  at  the  tip.  The  beam  is  modeled  using  the  elementary  beam  theory  where  the 
geometrical  configuration  and  physical  properties  are  shown  in  Figure  2.  The  consistent  mass  is  used  to 
define  M.  The  damping  matrix  C  has  only  one  nonzero  element  representing  the  magnitude  c  of  the 
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lumped  damper.  The  cantilever  beam  is  divided  into  20  equal  elements  and  has  40  degrees  of  freedom. 
The  order  of  the  associated  (A,  B)  is  80. 

We  use  the  Lanczos  algorithm  with  the  FRO  scheme  to  solve  this  problem.  Figure  3  summarizes 
the  results  of  8  experiments.  Here,  we  call  a  Ritz  pair  good  if  the  pseudo  length  of  its  associated  residual 
vector  is  less  than  10~*.  This  criterion  ensures  that  a  good  Ritz  pair  approximates  the  wanted  eigenpair  to 
high  accuracy.  From  the  results  in  Figure  3.  we  see  that  the  first  few  eigenpairs  can  be  extracted  at  a 
fairly  low  cost  compared  to  the  other  eigenpairs.  This  is  because  the  re-orthogonalization  cost  is  greater 
at  later  steps  in  the  Lanczos  algorithm. 

In  this  problem,  we  have  run  the  algorithm  to  compute  all  eigenvalues  in  order  to  test  the  robustness 
of  the  computer  program  developed.  However  we  emphasize  that  the  algorithm  is  intended  only  for  par¬ 
tial  solution  of  a  large  eigenproblem.  After  the  80  steps,  we  see  that  the  pseudo  length  of  the  81th  Lanc¬ 
zos  vector  is  0.9xl0-15,  which  is  at  the  round-off  level,  implying  that  the  computed  Lanczos  vectors  have 
spanned  the  whole  solution  space  as  they  should  in  exact  arithmetic.  This  desirable  result  will  ensure  that 
all  the  Ritz  pairs  obtained  from  the  solution  of  the  reduced  tri-diagonal  system  are  good  and  hence  are 
accurate  eigenpairs  of  the  system. 

Test  Problem  2  :  The  system  consists  of  two  beams  connected  by  a  hinge  with  a  rotational 
viscous-damper.  The  geometrical  configuration  and  physical  properties  of  the  system  are  shown  in  Fig¬ 
ure  4.  The  consistent  mass  matrix  is  used  for  M.  The  damping  matrix  C  has  only  four  nonzero  elements, 
which  are  due  to  the  lumped  rotational  damper.  The  system  is  divided  into  40  equal  elements  and  has  83 
degrees  of  freedor  he  associated  (A,  B)  is  of  order  166.  The  system  is  unrestrained  and  has  rigid  body 
modes,  so  we  app  lift  to  the  (A,  B)  to  compute  the  eigenpairs  of  this  unrestrained  system.  The  Lanc¬ 
zos  algorithm  with  FRO  scheme  is  used  to  solve  this  problem.  Figure  5  summarizes  the  results  of  9 
experiments.  Similar  conclusions  as  in  the  first  test  problem  can  be  inferred  from  Figure  5.  The  pseudo 
length  of  the  167th  Lanczos  vector  is  O.lxlO-14,  which  again  exhibits  the  robustness  of  the  computer  pro¬ 
gram  developed. 

In  general,  the  starting  vector  for  the  Lanczos  algorithm  may  be  chosen  arbitrarily.  However,  if  the 
starting  vector  is  orthogonal  to  any  of  the  eigenvectors  of  (A,  B),  all  the  Lanczos  vectors  will  also  be 
orthogonal  to  these  eigenvectors.  In  practice,  round-off  errors  eventually  will  introduce  components 
along  these  eigenvectors;  however,  round-off  enters  slowly  and  the  convergence  ‘o  these  eigenvectors  is 
deferred.  Therefore,  we  need  to  avoid  the  possibility  of  the  starting  vector  being  orthogonal  to  the  wanted 
eigenvectors  of  the  system.  Since  the  structural  system  in  this  test  problem  is  symmetric,  there  are  anti¬ 
symmetric  modes  as  well  as  symmetric  modes.  If  a  symmetric  starting  vector  is  used,  such  as 
(1,1,  . ,  1 ),  all  the  Lanczos  vectors  will  be  symmetric.  Accordingly,  all  the  anti-symmetric  modes  of 
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the  structure  will  be  suppressed  by  this  biased  starting  vector.  To  obtain  all  the  required  lower  modes,  we 
must  avoid  choosing  either  a  symmetric  or  an  anti-symmetric  starting  vector.  This  is  a  strong  reason  for 
using  a  random  vector  to  start. 

Test  Problem  3  :  This  problem  is  a  three  dimensional  space  truss  system.  There  are  44  nodes  and 
the  4  end  nodes  are  fully  restrained,  as  shown  in  Figure  6.  Thus,  there  are  120  degrees  of  freedom  and  the 
associated  (A,  B)  is  of  order  240.  All  truss  bars  have  the  same  density  and  Young’s  modulus  but  different 
damping,  as  shown  in  Figure  6,  resulting  in  a  nonproportionally  damped  system.  We  use  the  Lanczos 
algorithm  with  the  FRO  scheme  to  generate  60  Lanczos  vectors.  We  also  use  the  Lanczos  algorithm  with 
the  proposed  SRO  scheme  to  generate  60  Lanczos  vectors.  The  results  from  the  two  schemes  are  com¬ 
pared  in  Table  1.  The  SRO  scheme  is  shown  to  be  adequate  to  compute  the  desired  solution. 

Test  Problem  4  :  This  problem  is  a  larger  three  dimensional  space  truss  system.  There  are  300 
nodes  and  the  4  end  nodes  are  fully  restrained,  as  shown  in  Figure  7.  A  typical  cell  is  the  same  as  the  typ¬ 
ical  cell  in  the  test  problem  3.  There  are  888  degrees  of  freedom  and  the  order  of  the  associated  (A,  B)  is 
1776.  We  use  the  Lanczos  algorithm  with  the  FRO  scheme  to  generate  80  Lanczos  vectors.  We  also  use 
the  Lanczos  algorithm  with  the  SRO  scheme  to  generate  80  Lanczos  vectors.  The  results  from  the  two 
schemes  are  also  compared  in  Table  1. 

From  Table  1,  we  see  that  the  60  Ritz  pairs  obtained  provide  28  good  eigenpairs  for  test  problem  3 
and  the  80  Ritz  pairs  obtained  provide  40  good  eigenpairs  for  test  problem  4  for  both  FRO  and  SRO 
cases.  That  is,  approximately  two  Lanczos  vectors,  on  the  average,  are  required  to  capture  a  new  eigen¬ 
vector  for  these  two  large  problems.  This  implies  that  the  Krylov  subspace  generated  by  B-IA  and  a  ran¬ 
dom  vector  is  very  effective  in  approximating  the  least  dominant  eigenvectors  of  the  damped  dynamic 
systems  considered. 

By  maintaining  semi-orthogonality  between  the  Lanczos  vectors  with  the  SRO  scheme,  the  result¬ 
ing  Ritz  values  are  as  accurate  as  those  obtained  with  the  FRO  scheme,  as  shown  in  Table  1.  But  a  great 
part  of  the  re-orthogonalization  steps  can  be  eliminated  by  using  the  SRO  scheme  instead  of  the  FRO 
scheme.  That  is,  we  can  eliminate  partial  re-orthogonalization  effort  without  sacrificing  accuracy  of  the 
final  solution  when  solving  X  A  z  =  B  z  with  the  SRO  scheme.  This  is  in  agreement  with  the  case  of  solv¬ 
ing  co2  M  w  =  K  w  by  a  standard  Lanczos  algorithm  with  the  SRO  scheme. 

To  assess  the  efficiency  of  the  Lanczos  algorithm,  the  lower  mode  solutions  of  the  above  four  test 
problems  are  also  computed  using  a  subspace  iteration  algorithm.  The  subspace  iteration  algorithm 
reported  in  [Chen  and  Taylor,  1986]  is  used  for  this  purpose.  The  subspace  dimension  is  determined  by 
min(2n,  «+  8),  where  n  is  the  number  of  wanted  eigenpairs.  Table  2  compares  the  cost  of  the  Lanczos 
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Table  1  Results  of  Test  Problems  3  and  4 


item  problem 

Test  Problem  3 

Test  Problem  4 

FRO 

SRO 

FRO 

SRO 

number  of  Lanczos 

vectors  generated 

60 

60 

80 

80 

number  of 

re-oithogonalizations 

1770 

602 

3159 

1246 

CPU  time  spent  on 

generating  Lanczos  vectors 

40.3 

32.6 

536.6 

473.5 

CPU  time  spent  on  solving 

reduced  eigenproblem 

50.8 

51.7 

100.9 

101.2 

total  CPU  time  spent  on 

solving  the  whole  problem 

113.9 

106.7 

893.2 

830.9 

number  of  good 

Ritz  pairs  obtained 

28 

28 

40 

40 

algorithm  with  the  cost  of  the  subspace  iteration  algorithm.  It  is  apparent  that  the  Lanczos  algorithm  is 
considerably  more  efficient  than  the  subspace  iteration  algorithm  for  the  examples  considered.  However 
more  sophisticated  versions  of  subspace  iteration  might  perfoim  somewhat  better  than  ours  but  not 
enough  to  alter  the  striking  contrasts  in  Table  2. 

Breakdown  occurs  when  co j  =0  for  some  j .  The  algorithm  provides  a  bad  basis  if  there  are  any  co; 
as  small  as  ^e=  10"®.  In  Figure  8  we  plot  the  sign  (co;  )*  log  - — —  against  j .  The  result  is  typical  for  our 

examples.  Quite  quickly  I  coy  I  drops  to  1(T3  but  seems  to  stay  at  that  level  without  deteriorating  for  120 
steps.  We  have  no  explanation  of  this  phenomenon. 
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Table  2  Results  from  different  algorithms 


Test 

Lanczos  algorithm 

subspace  iteration  algorithm 

Problem 

No.  of  good 

Ritz  pairs 

CPU  time 

(second) 

No.  of  good 

Ritz  pairs 

CPU  time 

(second) 

1 

8 

8.5 

8 

40.2 

2 

20 

41.7 

16 

214.6 

3 

28 

113.9 

24 

1012.9 

4 

40 

893.2 

40 

20992.8 
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Figure  1  Outline  of  Lanczos  inner  loop  with  operator  H-1A 


Step  1  : 

Operation  Count 

pick  a  starting  vector  r  (default  is  random) 

p  =  A  r 

2n(M)  +  p(C) 

solve  H  q  =  p 

v(K) 

pi  =  lqT,l>'J 

In 

q<-q/pj 

2  n 

p  =  Aq 

=  (qT  p) 

2|i(M)  +  p(C) 

solve  H  r  =  p 

v(K) 

aj  =  (rT  p) 

In 

r  <-  r  -  q  (ax  /  C0j) 

2  n 

oldp  =  A  r 

2n(M)  +  p(C) 

p2=  1  rT  r  1 1/2 
©2  =  (rT  oldp)/ 1  rT  r  1 

In 

store  q  as  qx 

Loop:  Forj  =  2,3... 

Operation  Count 

oldq  q 
oldp  i — >  p 
q  *  r  /  py 

2  n 

p  <-  p  /  py 

2n 

solve  H  r  =  p 

v(K) 

(rToldp  should  equal  (3y ) 
r  «-  r  -  oldq  (fy  /  ©;_x) 

2  n 

a,  <-  rTp 

2  n 

r «-  r  -  q  (a;  /coy) 

2  n 

oldp  =  A  r 

2^(M)  +  p(C) 

P;+i=  1  rT  r  1 1/2 
ci)j+i  =  (rT  oldp)/ 1  rT  r  1 

2  n 

(repeat  to  maintain  local  orthogonality) 

store  q  as  q; 


number  of  good  Rltz  pairs  CPU  time  (second) 


Figure  3  Results  of  Test  Problem  1 


Figure  4  Test  Problem  2 


Figure  5  Results  of  Test  Problem  2 
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Figure  8  sign  («  )•  log  -j — j-  versus  j 


